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Abstract 

A general statistical thermodynamic theory that considers given sequences of x-integers to play 
the role of particles of known type in an isolated elastic system is proposed. By also considering 
some explicit discrete probability distributions px for natural numbers, we claim that they lead to 
a better understanding of probabilistic laws associated with number theory. Sequences of numbers 
are treated as the size measure of finite sets. By considering px to describe complex phenomena, 
the theory leads to derive a distinct analogous force fx on number sets proportional to at 
an analogous system temperature T. In particular, this yields to an understanding of the uneven 
distribution of integers of random sets in terms of analogous scale invariance and a screened inverse 
square force acting on the significant digits. The theory also allows to establish recursion relations 
to predict sequences of Fibonacci numbers and to give an answer to the interesting theoretical 
question of the appearance of the Benford's law in Fibonacci numbers. A possible relevance to 
prime numbers is also analyzed. 
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I. INTRODUCTION 



There are suggestive parallels between various aspects of number theory and physics phe- 
nomena. The connection often involves affinity with functions arising in statistical mechanics 
For example, a connection between given number sets and the thermodynamic formal- 
ism has been established for the multifractal spectrum via a Legendre transform 0] and the 
statistical mechanics applications for prime numbers via the Riemann partition function ^. 
Bose condensation has been indirectly linked to the zeta function in Q]. Primes as such have 
also attracted recent applications in acoustics and dynamical systems jsj. Furthermore, the 
existence of an imaginary quantum mechanical potential for the prime numbers has been 
idealized in after some energy considerations. In a similar context, additive number the- 
ory in terms of scale invariance has been used to study the origin and form of the uneven 
distribution of integers of many random data sets 

In this paper we introduce an alternative approach which considers sets of x-natural 
numbers to play the role of particles of known type in an isolated elastic system and assume 
that their discrete probability distribution can be split as the product of some functions. By 
applying thermodynamics techniques, we develop an analogous statistical mechanics version 
of the average energy and energy states of arbitrary number sets. 

From the Helmoltz free energy in a reversible isothermal process, a relation for an analo- 
gous tensile force F > is derived by treating x as variations in length in the direction of F 
and considering the number system as a elastic specimen (body). In particular, our theory 
allows to give an alternative physics answer to the uneven distribution of integers in terms 
of a screened inverse square force which ideally may act on these numbers. It also allows 
to establish second order polynomial recursions to predict sequences of integers following 
generalized Fibonacci numbers. We also analyze a possible application of the distinct forces 
to primes. 

Hereafter, the word x-number is used to refer to quantities which are integers and positive. 
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II. UNDERLYING THEORY 



A. A priori probability postulate 

Let us define the discrete probability distribution of finding a certain positive natural 
number x as 

Px = gi{x) g^ix) , (1) 

with gi{x) > and < g2{x) < 1 and satisfying the normalization J2xPx — 1- Such discrete 
probability distribution functions (and also discrete mixtures of them), used to describe 
complex phenomena, are given by the product of two different functions. The real function 
g2{x) here represents the set of digital frequencies which remain fixed under base changes 
given by the real function gi{x). 

This factorization may be seen somewhat arbitrary, however it is based on composed 
functions which include the Binomial(A,B), Furry or Geometrical, inversed Logarithmic, 
Pascal or Polya and Poisson distributions. Furthermore, a probability decomposition into 
(two) functions is no new. This type of factorization appears formally, e.g., in the analysis of 
stochastic processes on graphs. According to the well-known Hammersley-Chfford Theorem, 
a probability function p{x) is formed by a normalized product of positive functions on cliques 
{i.e., fully connected subgraph) of the graph Q when the stochastic process x is Markov 
relative to Q. 

To simplify notation in all later sums and products the variable x represents an upper 
bound (or cutoff). 

B. Thermodynamic Analogy 

Using the above probability postulate the information theoretical entropy Sj — 
— J2x Px in Px then becomes 

Si = -J29i{x) g2{x) {In gi{x) + In g2{x) } . (2) 

X 

In order to make a thermodynamic analogy, let us consider next the Helmholtz free energy 

A = E — TS assuming an analogous temperature T and volume V to be independent vari- 
ables, with E the analogous internal energy of number sets. Because of the close relationship 
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of Eq.Q to the thermodynamic entropy 5", we can readily identify 

A/kT ^ "^p^ In gi{x) , (3) 

X 

E/kT ^ -J2p,lng2{x) . (4) 

X 

with k the Boltzmann constant. Reduced units are used throughout. The choice of the 
latter quantities may be seen quite arbitrary, but as mentioned in the conclusion these can 
be justified in analogy with standard thermodynamics results. 

In this case, for the correct application of conventional thermodynamics the normalization 
of Px is used to define both the Boltzmann and the Shannon entropies. 

For 'non-interacting' number sets x > 1, we can get an approximated statistical mechanics 
version of the average energy as the sum of individual contributions E ^ E = J2x '^x^x- The 
total number of positive natural number x's is given by = Yl,x ^x and ex represents their 
analogous energy states. Thus, from Eqs.dH) and (jlj we further identify 

fix -In g2{x) > , (5) 
€x/kT gi{x) g2{x) = Px > ■ (6) 

The number of states available to a molecular system is given by the partition function 
Q which satisfies the relation S = k In Q{N, V, E) in the microcanonical ensemble and 
relates the canonical partition function by Q{N, V, T) = ^1{N, V, E)e~^f^'^ . Therefore, from 
Eqs.©, Q and (jH)) we also have in analogy 

Qx = \{[9^{x)]-^^ = \{[g,{x)r^''^ . (7) 

X X 

As expected, the form of our canonical partition function depends on the number probability 
distribution of Eq. ((H) . 



C. Analogous Force 

To derive a relation for an analogous external tension which we may ideally apply on 
number sets in a gedanken experiment, let us treat x as a variation in length in the direction 
of a tensile force fx and the number system as an elastic body. As a first approximation, 
the required relation is then obtainable directly from the first and second laws of thermo- 
dynamics in a reversible isothermal process For a change at constant T, it follows that 



dA = dW = f di, where W is the work done on the system by its surroundings in a small 
elongation di. Hence, by partial differentiation of Eq.Q, 

■^"^ (sx)^ gi{x) ( dx ^i*^^) (^x)^ ' 

Since the free energy depends on the upper cutoff x then the derivative was taken in terms 
of that variable, li.e. the derivative of the last term in the sums. If fx < 0, this corresponds 
to an analogous compressive force on the number system. We shall show next that this new 
tensile force may indeed be useful to characterize number sets. 



III. PROBABILITY DISTRIBUTIONS AND RELATED FUNCTIONS 
A. First Digits Patterns: gi{x) = cq , 52 (a;) = fl^n x] 

In terms of the first digits of unbiased random samples from random distributions, the 
so-called Benford's law follows the logarithmic distribution 

Px = logio {1 + ^) = Co In -f{x) , (9) 

with 1/co = In 10, ■y{x) = 1 + ^ and x = 1,2,---. This empirical law deals with the 
distribution of first digits in numbers in which significant positive digits are not uniformly 
distributed, but smaller digits and smaller combinations of significant digits are favored. 
The sum of probabilities in this case is 

X + 1 

Xl^^ = Z^^' = CoXl^^ ^(^) = ^0 n( ) = Coln{l + x) . (10) 

X X X 

This result implies that px is not a proper probability since it imposes some maximum value 

El- 

From Eq. ((Tj) the evaluation of the analogous partition function is straight forward, namely 

= = (l + x)~"o'""« . (11) 

X 

According to Eq.®, our analytical expression for the analogous force acting on this 
number set becomes 

= — . (12) 

Co In Co X + x^ 

It is from this base-invariant behavior of a distinct fx that we can make an attempt to 
understand the significant digit distribution of Eq. in terms of an screened inverse square 
force. 
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B. Relevance to Primes: gi{x) = e ^ , g2{x) = x " 

If we generalize the above to a power law behavior for the sequence of primes only greater 
than one with an exponent cr > 1 , i.e., p^p = xp, then the class of analogous forces 
on prime numbers becomes 

f^. = -^ > (13) 



x: 



p 



-a fingerprint for primes. 

Using Eq.©, the analogous partition function approaches 



l/e ^ n 1 ™— (T 5^ ™cr ■ (1^) 



Another possible relation of our statistical thermodynamic approach to prime numbers 
follows by considering the average system energy. In the limit of Eq. ()12j) . this takes the form 

Co X ^ 

In principle this could be asymptotic to Y.x where the counting function 7r(a;) denotes 
the number of prime numbers Xp that do not exceed x jl^ . 

IV. DISCUSSION 

A. Benford's law, Analogous Force and Thermodynamics 

As an example, in Fig^ we plot the probability distribution and our distinct force on 
(up to 300) numbers from Fibonacci series and compare results with the present analogous 
statistical thermodynamics approach. The theoretical calculation of the analogous force 
is obtained using Eq. p2|) (and its large-x behavior). The numerical calculation of for 
Fibonacci numbers is also rather simple. This is derived using Eq.(jH)) and from the fact that 

is constant. By the two-point central formula we approximate 

/^//nco=^^^ ^]^{Vx+i-Vx~i) ■ (16) 

This means that the probability distribution px is related to an analogous local potential 
ield at each number [i.e., particle) as in the well known diffusion limited aggregation model 
isl ] . Assuming a sort of superposition principle, the total analogous force F = J2x fx can 

also be readily evaluated. From Eq.lfT^ we obtain F/cq = —In 00/7(0;). 



Since cq ~ 0.4343 in logarithm base 10, we then have an analogous system entropy S* > 0, 
the analogous Helmholtz free energy A < 0, the analogous partition function Qx > 0, an 
analogous expansive force on the integers fx > and the sum of probabilities > 0. 
Interestingly, from Eq. ljlOp we also have that this sum only becomes normalized at a; = 9 
{i.e., a decadelong). On the other hand, the increase in the analogous entropy as x increases 
follows the natural tendency of thermodynamic systems to move toward disordered states 
having more microscopic states characterized by our analogous macroscopic variables. 

Besides, the particular 7(x) under consideration allows to derive a measure for an anal- 
ogous inverse temperature of the integers system. From Eqs.Q, (jH) and 0, we thus ap- 
proximate 

^S/k) _ Incp 

A{E/kT) ln{ln^{x)) ' ^ ^ 

where A represents functions difference between consecutive number fluctuations. 

For X >> 1, Eq. ()12|) implies that ~ CqX~^. Such a power law behavior with exponent 
one is the solution of the functional form p^' = Pax = ^aPx which may correspond to an 
analogous scale invariance jlol |. 

The results shown in the figure allows to argue that our simple physics-based approach 
is not merely speculative. Indeed it yields to a different and more complete understanding 
of Benford's law than that of previous attempts ^, Q, U, 3. It includes scale invariance, 
frequencies behavior under base changes and explanation for a logarithmic law distribution 
besides the analogous thermodynamic characterization of given number sets such as time 
series or primes. 



B. Screening 

For X >> 1, the force — — can be approximated as — so we could consider the 

' CO In CO ' 

exponential term as being screening. The presence of exp{—l/x) screening in this analogous 
inverse-squared force implies that the second leading digits {i.e., 10 < x < 99) are much 
more uniformly distributed than the first digits, and the third than the second, and so on 
according to observations This is reflected by the numerical differences /i — /g = 0.17708, 
/lo ~ /gg = 0.00326, /loo — /ggg = 0.00004. The main message is that the asymmetry in favor 
of smaller significant digits or combinations of significant digits can be understood in terms 
of analogous screened inverse square force acting on the number sets. 
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FIG. 1: Probability distribution of, and distinct force on, Fibonacci numbers. 

Indeed screening could be a way to unveil the fact that significant digits are not indepen- 
dent, but that instead knowledge of one digit affects the likelihood of another. Furthermore, 
since screening is a consequence of having the base invariant probability distribution of 
Eq.(|TI), then the choice of digital frequencies bounded byl<l + i<e leads to support a 
logarithmic law distribution. 
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C. Recursions: A Possible Scenario for Number Prediction 



From a physical point of view, the equihbrium of forces within defect-fr ee p late under 
external shear can be approximated by the discrete Laplace equation [ 



iefect-fr ee p i 



J2 GkAuk = (18) 

k=l 

where Auk = Ukii) — u{i) denotes relative displacements of the ith. lattice point with re- 
spect to its nearest neighbors and Gk is the corresponding modulus of rigidity of the bond 
connecting the iih lattice point to its neighbor. 

From Eq. ()18|) it follows then that for a ID system 

= GiMi(z) + 62^2(0 , (19) 

w\iemG, = G,lY.iG, (z = l,2). 

It is not unreasonable to assume that the moduli of rigidity of our analogous elastic 
system are proportional to each other, at least prior to any equivalent breakdown condition 
which may prevail. Therefore if we consider Ui{i) oc {G2/ Gi)u2{i — 1), then we have that all 
displacements M2, ui and u follow generalized fibonacci sequences of the form j3| 

U{i + 2)=U{i)+U{i + l) . (20) 

Within our thermodynamics framework, the sequences of x-integers played the role of 
analogous 'non-interacting' particles. Besides, their magnitude may be seen as representing 
the above effective (increasing) displacements, say expected values x = £\U], at different 
times i. The analogous force may be in turn seen as an applied shear acting on our ID 
isolated elastic system of finite size x, which is being pulled out (or in). 

Therefore following these ideas from fracture phenomena, a possible scenario to deduce 
second order polynomial recursions leading to predict sequences of numbers becomes possi- 
ble. 



V. CONCLUDING REMARKS 



In this paper we have introduced a quite abstract statistical mechanical system whose 
states are represented by natural numbers. By considering some explicit probability dis- 
tributions for the natural numbers, we claim that they lead to a better understanding of 
probabilistic laws associated with number theory. 
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Within this kind of thermodynamic formahsm we have investigated probabihty distri- 
butions which lead to Benford's law. Sequences of numbers have been treated as the size 
measure of finite sets and, in analogy with statistical thermodynamics, they have been as- 
sumed to play the role of particles of known type in an isolated elastic system. 

The relation to physics is rather in the thermodynamical formalism and terminology for 
number theoretical problems. The assignment of statistical quantities here proposed are 
not arbitrary. Our analogies for the considered functions, e.g., Helmoltz free energy, etc, 
follows the similar thermodynamical approach as proposed within Multifractal physics. The 
expression of the Shannon entropy in Eq.Q split into two terms with the identification of 
one of the terms with an analogous free energy and the other with an analogous internal 
energy can be justified (although not rigorously) in terms of the form of our canonical 
partition function which depends on the number probability distribution of Eq.(^ as may 
be expected. Furthermore as derived from such identification, we can argue that the relation 
in Eq. ()17|l tends to unity in the large x-limit in analogy with the standard thermodynamic 

definition T = f |f ) . 

\as ) v,N 

Though this scheme may be seen as rather abstract, we have shown that it leads to very 
interesting connections with statistical thermodynamics and allows to interpret physically 
the observed probability distribution of first digits in terms of a stretching force (tension) 
which ideally may pull integers number sets along analogous lengths. 

The introduction of an analogous force within our formalism has been done on a formal 
level. The new insights gained by such a formal introduction and its utility include, in 
particular, the study of the dynamics of given number sets by establishing second order 
polynomial recursions to predict sequences of generalized Fibonacci numbers. This gives 
an answer to the interesting theoretical question of the appearance of the Benford's law in 
Fibonacci numbers. 

Following recent interest in Benford's law we believe the new force may find appli- 
cation to detect fraud in financial data [2^, by looking for anomalies in their analogous 
macroscopic thermodynamic variables, in the use of logarithmic computers to speed calcu- 



lations 



21[, by analyzing the screening behavior of the analogous inverse square forces on 



floating points, or in the characterization of the hopcount in Internet |22| |. 
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